What a Transfer-Based System Brings to the Combination with PBMT
نویسندگان
چکیده
We present a thorough analysis of a combination of a statistical and a transferbased system for English→Czech translation, Moses and TectoMT. We describe several techniques for inspecting such a system combination which are based both on automatic and manual evaluation. While TectoMT often produces bad translations, Moses is still able to select the good parts of them. In many cases, TectoMT provides useful novel translations which are otherwise simply unavailable to the statistical component, despite the very large training data. Our analyses confirm the expected behaviour that TectoMT helps with preserving grammatical agreements and valency requirements, but that it also improves a very diverse set of other phenomena. Interestingly, including the outputs of the transfer-based system in the phrase-based search seems to have a positive effect on the search space. Overall, we find that the components of this combination are complementary and the final system produces significantly better translations than either component by itself.
منابع مشابه
Innovation, Demand, and Responsibility: Some Fundamental Questions About Health Systems; Comment on “What Health System Challenges Should Responsible Innovation in Health Address? Insights From an International Scoping Review”
In this commentary on the exercise of Lehoux et al (this volume) I argue that in discussions on the current challenges of health systems, a better diagnosis of the health system is required. The cause of responsible innovation in health (RIH) requires a better understanding of the dynamics of health systems, in particular how innovation, demand, and responsibility are m...
متن کاملPre-Translation for Neural Machine Translation
Recently, the development of neural machine translation (NMT) has significantly improved the translation quality of automatic machine translation. While most sentences are more accurate and fluent than translations by statistical machine translation (SMT)-based systems, in some cases, the NMT system produces translations that have a completely different meaning. This is especially the case when...
متن کاملComparing Language Related Issues for NMT and PBMT between German and English
This work presents an extensive comparison of language related problems for neural machine translation and phrase-based machine translation between German and English. The explored issues are related both to the language characteristics as well as to the machine translation process and, although related, are going beyond typical translation error classes. It is shown that themain advantage of t...
متن کاملAdopting New International Health Instruments – What Can We Learn From the FCTC?; Comment on “The Legal Strength of International Health Instruments - What It Brings to Global Health Governance?”
This Commentary forms a response to Nikogosian’s and Kickbusch’s forward-looking perspective about the legal strength of international health instruments. Building on their arguments, in this commentary we consider what we can learn from the Framework Convention on Tobacco Control (FCTC) for the adoption of new legal international health instruments.
متن کاملTowards a better understanding of statistical post-edition usefulness
We describe several experiments to better understand the usefulness of statistical post-edition (SPE) to improve phrasebased statistical MT (PBMT) systems raw outputs. Whatever the size of the training corpus, we show that SPE systems trained on general domain data offers no breakthrough to our baseline general domain PBMT system. However, using manually post-edited system outputs to train the ...
متن کامل